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" , .to admit things not visible to the gross creatures that we are is, in my opinion, to show a 
decent humility, and not just a lamentable addiction to metaphysics". J. S, Bell (1) 


1. INTRODUCTION 


Within the tradition begun by Bell(), Greenberger, Horne, and Zeilingerl*) (henceforth GHZ) have 
recently sketcheda simple proof that local deterministic hidden-variable theories are inconsistent with 
the strict correlations predicted by quantum mechanics (QM) for the following new 
Gedankeaexperimeat A spin-1 particle, initially in the state m=0 in a magnetic field along the z- 
axis, decays into two spin-i particles moving in opposite directions along this axis; and then each 
subsequently decays into two spin-1/2 particles, one pair (A and B) moving in the +z direction and the 


"Teis a pleasure to dedicate this paper to the dete) of Professor John S. Bell whose brilliant work 

over the last three decades did so much to clarify the foundations of quantum mechanics. 

1 Depertment of History and Philosophy of Aeiease, Cambridge University, Free School Lane, Cambridge CB2 3RH 

UK. 

2 Faculty of Philosophy, Cambridge University, Sidgwick ite, Bidgwick Ave., Cambridge CB3 ODA UK. Ne 7 
3 All abbreviations we shall use in this paper are collected in an appendix ot the end, with 1a definitions, for easy/— 
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other (C and D) inthe -z direction. After all decays have occurred, and the four particle system is left 
inthe state: 


|'¥ > = 1/72 (j0,(A) = +1> @ |0,(B) = +1> @ |0,(C) = -1> @ |o,(D) =-1> - 
]0,(A) = -1> @ |0,(B) = -1> @ |0,(C) = +1> @ Jo,(D) = +1>) 


the spin of cach particle is measured at spacelike separation from the others using spin measurement 
apparatuses, or ‘spin-meters'4 (for short), with orientations a, b, c, and d in the x-y plane 
perpendiculartothe particles’ line of flight. Spacelike separation can be arranged by requiring that 
the particles in each of the pairs (A, B) and (C, D) have different masses, thus making them leave the 
second spin-1 decay with different velocities. Then, after the particles in a given pair separate 
sufficiently along the z-axis, spin-meters can be placed intheirrespective pathstointerceptthem>. 


The expectation value of the product of the four measured spins along directions a, b, c, and 
d,ie.<¥\(@ (A)- ao (B)- bo (C)- cXo (D)- d)M>, caneasilybecomputed using the 
relations 0,|0,=+/-1> = |0,=-/+1>; Oy|0,=+/-1> = +/-i{0,=-/4+1>; and Euler's equation to simplify. 
This yields -cos(0, + Op - 9c - Op), where 0, [respectively,Op, Oc, and Op] is the angle between a 
[b, ¢, and d) and some arbitrary fixed direction in the x-y plane. To preserve continuity with the 
Bell's theorem literature, we shall refer to this expectation value as a ‘correlation coefficient between 
the four spin observables. However, untike the Bell two-variable Gedaakeaexpeniment, it has no 
direct connection with standard definitions of correlation between greaterthan two variables). 


For this Gedaakenexperimen{HZ start with a local deterministichidden-variableframework 
4 We use this generic term rather than ‘Stern-Gerlach davioes’ because the latter are az pranejpi incapable of messwing 
the spin of free elementory particlest9). 
5 To achieve the requisite spatial separation withovt restriction to the z-axis, one might propose that either: (9) a Geld 
De sat vp to deflect the (A,B) [[C,D)] particles xZjubédy away from each other after the second spin-1 decay; or (}) only 
those particle pairs which, of their owa accord, leave the second decay in siguit/y differeat directions be considered. 
The advantage of (2) and (b) is that each eliminates the need to move the spin-meters into place before each 
measwrement trial, so that % would be easier to actually perform the experiment. However, since [> presupposes that 
all porticles move along the 2-tcis, one would then need to check that with a swiiciently small devistion from this axis, 
spin-orbit coupling effects will not significantly alter the predictions prescribed by |'¥>. In particular, if the strict 
correlations that section 3's proof uses are lost, one would have to twn to S's nonstrictly correlated version of the 
proof. Alternatively, one could consider measwrements on other quantum states (with 3 or more correlated spin-1/2 
particles) that do not presuppose a z-axis restriction, but nevertheless predict strict comelstions. Since writitig this 
~paper, Mermin(‘) has elegantly shown that many such stares exist for which an informal version of 3's strictly 
correlated aonlocality proof goes through; and Greenberger w 2/ (5) have studied analogues of these states not involving 
spin whieh are experimentally realizable via ‘three-particle interferometry’. 
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and sketch how a contradiction can be derived between the two-valued ‘result functions’ required by 
this framework, and the extremal values, +1, of the correlation coefficient, which will obtain when 
84 + Og - 8c - 8p = 0 or +M. The contradiction is not just statistical (as in Bell's theorem) but 
‘algebraic’, in the sense that values cannot be consistently assigned to all measurement results 
(determined by the result functions) given the wide range of settings 9,4 through 9p for which |¥S 
predicts strict (i.e., +1) correlations. This result, if correct, is surprising in light of the fact that 
hidden-variabletheories can easily (and consistently!) reproduce all the strict correlations of simpler 
quantum systems like the spin-1/2 singlet state (cf. Ref. 1, pp. 16- 17). 


Interestingly, because they invoke only strict correlations, GHZ appear to claim some 
philosophical generality over Bell's theorem based on the fact that their proof applies under just those 
conditions that permit inferences, from certain classical ‘locality’ and ‘reality’ assumptions, to the 
existence of Einstein-Podolsky-Rosen (EPR) (deterministic) ‘elements of reatity'(?-!3), For given 
[¥>'s strict correlations, knowledge of any three measurement results allows one to predict the fourth 
with certainty; and givena ‘no disturbance’ locality assumption, one can then apply the well-known 
EPR sufficient condition so as to assert an element of reality pertaining to the fourth particle. 
Because of this, GHZ dub strict correlations as ‘the super-classical' case; and they claim that because 
their proof restricts itself just to this case, itis more important than Bell's ‘from the point of view of 
reality’. GHZ's idea seems to be that they show the nonexistence of EPR elements of reality 
precisely in those cases where their existence is most plausible. But in 4 we shall disagree with this 
emphasis, arguing that, from the standpoint of Bohr's response to EPR, the existence of elements of 
reality is neither more nor fess plausible in the context of GHZ's proof, than in the context of Bell's, 
which doesn't confine itself solely to the superclassical case. 


In any case, GHZ's claim that their proof goes ‘beyond Bell's theorem’ is not justified until 
their result is proved under assumptions at least as weak as those adopted in the most general version 
of Bell's proof (Ref. 1, Ch. 7). In particular, the latter proof does not rely on determinism - either in 
the evolution of hidden-variables prior to measurement, or in the sense that they encode intrinsic 
possessed values for observables; nor on ‘faithful measurement’ - the assumption that measurement 
results are solely a product of variables associated with the measured system, i.e. apparatus hidden- 
variables are causally irrelevant. In contrast, GHZ do not consider apparatus hidden-variables which 
might be causally relevant to the measurement results; and they explicitly assume that the particle 
variables at the time of measurement are uniquely determined by those which exist back at the first 
spin-1 particledecay®, ruling out evolutionary indeterminism. They also assume that the variables at 
the time of measurement uniquely determine each measurement result, thus presupposing intrinsic 


6 GHZ stridvte this asswmption to Bell (Ref. 1, Ch. 2). However, there Bell is neviral as to whether the hidden- 
voviable srguments of his result Functions represent the state of the porticles when leaving the sowve, or @ some Inter 
‘suitable instant’ (ef. p. 15). 
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possessed values or elements of reality’. 
" Ge, GHZ justify this last assumption by assuming ‘locality’ (in the sense that results 
are ‘undisturbed’ by distant measurements) and using |‘¥>'s strict correlations to argue, in EPR 
fashion, for the existence of elements of reality (just as Bell did in his first nonlocality proof using 
the singlet state's strict correlations - cf. Ref. 1, Ch. 2, and Bell's comment on p. 157, note 10). 
However, itis surely desirable to seek a less informal, more cogent, version of EPR's argument to 
supplement GHZ's - particularly in light of the almost unanimous agreement among recent 
commentator$12-15) that the EPR argument needs to assume that the particles are ‘separable’ - a 
notion not captured (at least unambiguously) by EPR's (and GHZ's) talk of ‘no disturbance’ between 
each particle and the distant apparatuses. Our generalization of the GHZ argument to a stochastic 
hidden-variables framework (i.e. a framework in which elements of reality are not assumed) shall 
begin, in effect, with a probabilistic formalization of EPR's argument in which an assumption will 
appear that plays roughly the same role as separability (though, in general, is weaker than it - or so 
we shall argue). We will also use EPR-type reasoning in a novel way to show that independence of 
measurement result probabilities from apparatus hidden-variables follows from our proof's locality 
assumptions and |‘! >'s strict correlations. Finally, we shall define the various hidden-variables that 
may be causally relevant so as not to presuppose evolutionary indeterminism. Through all of this, w 
intend to show that GHZ’s proof can indeed be made as general as the most general version of Bell) 


proof (Ref. 1, Ch. 7). 


The overall plan of our paper is as follows. In 2, we broadly characterize the class of 
stochastic realist theories that aim to reproduce |‘¥>'s correlation statistics; and we motivate the 
locality principles to be used in a way distinct from previous authors, taking our cue from recent 
criticisms of these assumptions, In 3, we give our more general proof of GHZ's result adopting an 
explicitly measure-theoretic approach. It turns out that, within this approach, the details of GHZ's 
proof of contradiction (between |"? >'s strict correlations and the various result functions) need to be 
modified to produce a valid proof. In 4, we briefly compare our proof to other algebraic proofs of 
nonlocality, and then argue that itdoes not improve on Bell's proof with respect to Bohr's response 
to EPR. However, we argue that it does improve on Bell's proof (particularly, the most general 
version of that proof, which doesn't invoke any strict correlations - cf. Ref. 1, Ch. 7)in some other 
respects not discernable from GHZ's own treatment. Finally, in5, we show that a modified version 
of our proof, of roughly the same strength as Bell's, goes through without invoking experimentally 
unrealizable strict correlations. In fact this modified proof leads to a new ‘quasi-algebraic’ locality 
inequality susceptible to experimental test. We conclude that, on balance, our proof compares 
favourably to other nonlocality proofs at the level of Gedaakeaexperiment but it remains an o 
question whether our proof can surpass Bell's in adaptability to experiment. 


2. THE PROOF'S ASSUMPTIONS IN THE CONTEXT OF A GENERIC REALIST 
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Broadly speaking, realism is the view that physical systems: (a) exist independently (in particular, 
exist whether or not they are about to enter measurement interactions); and (b) possess intrinsic 
properties, or states, which (at least partially) account for measurement results and/or their statistics. 
(‘At least partially’ allows fora contribution to measurement statistics fromthe apparatus’ own state. ) 
Such independently existing states we shall henceforth call 'existents' (following Shimonyt!6)), 
departing from the traditional term ‘hidden-variables'. Our reasons are two-fold: as emphasized by 
Bell (Ref. 1, p. 92), 'hidden-variable' wrongly suggests that aspects of a system's state can never be 
revealed through measurements; and that term prohibits, #rvon! the states attributed by QM itself 
from being complete objective descriptions which do not need supplementing by as yet unspecified 
(‘hidden’) variables. 


In our Gedunkenexperimeat consider a particular quadruple of spin measurements 
undertaken with settings 8,4, 9p, 9c, and Op which are fixed (not necessarily by ‘free-willed’ 
experimenters) /vst prvor to the onset of each measurementinteraction. Let the existent A denote the 
complete objective state (or set of states) in the emare causal history of the four particles prior to 
these four measurement interactions. (The term ‘causal’ allows either stochastic or deterministic 
causation.) This very general characterization of A allows for the possibility that the particles’ state: 
(a) evolves indeterministically in time prior to the measurements; (b) fails to encode determinate 
values for all (or any) observables; (c) failsto be spatially (or temporally) separable into aconjunction 
of determinate states pertaining to each of the four particles (or to each of the times prior to 
measurement); and (d) failsto satisfy the Markov property that the probability of an event (e.g. , the 
registration of each measurement result) depends only onthe state(s) immediately preceding itintime 
and not also on earlier states (e.g. of the measured, or measuring, system). (For a brief discussion of 
theories in which (d) holds, cf. Ref. 17, and references therein. ) 


Similarly without loss of generality, letthe existent jtal|-s, [lc, [Lp] denote the complete (set 
of) objective state(s) of the A[B, C, D] spin-meter and its associated particle detection apparatus, in 
their entire causal history prior to the A [B, C, D] measurement. Each pafptp, Hc, ftp] will be 
assumed to include themacroscopic orientation 04 [Op, 8c, Op] fixed justdefore the interaction with 
the A [B, C, D] particle, however the latter is an important parameter which we shall refer to 
explicitly. 


The spacetime regions with which these various existents would be associated for a typical 
'4-measurement' are depicted in figure 1, where we have assumed that the (A, B) {(C, D)] 
apparatuses intercept their respective particles from opposite sides of the z-axis in the x-z plane in 
order to minimize the possibility that their causal histories overlap. (More on this point, and other 
details of figure 1, below.) 
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Figure | - Spatio- Temporal Regions for Existents in a Generic Realist Theory 


Let P(€,4,€p,€c,€pA. LA LB, Hop. 04,85,8c,8p) be the joint probability that the four 
measurement results€, through fp (each = +1, in units of h/2m) are obtained’, given the existents A 
and jl, through |p, and the settings 9, through 9p. Tominimizenotationalcomplexitywe define € 
= <£,,€p,€c,Ep>, [lL = <a, Mp. Mc,pp>, and @ = <0,4,0p,0¢,8p>, so that the this probability reduces 
to just P(e/A, 11,6). 


7 The ‘results’ €a through Ep cab represent states of the apparatus ‘pointers’ and/or the particles after the 4-measwement 
is completed. Despite SchrOdinger’s cat, ow assuming these results are (macroscopically) distinct from eachother swely 
does not compromise the generality of ow proof. For the realist will resist the superposition of macroscopically 
Ustinowichable states in favow of e.¢. cnontansous wave packet reduction theories 28). or manv-worlde theortect 19). 
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In general, for fixed @ the existents will vary from one 4-measurement to another for two 
reasons. First, because their preparation is assumed to be (at least at present) uncontrollable. 
Second, because they are defined as states associated with a spacetime region outside the four 
measurementinteractionregions and their future lightcones, and the temporal ordering (in the lab 
frame) of these latter regions may vary from one 4-measurementto another. 


Thus for consistency with the 4-measurement correlation coefficient dictated by |‘¥>, the 
realist must recover it, in particular its extremal values, by averaging the corresponding correlation 
coefficient prescribed by the existents, using some density distributions representing ignorance about 
which existents obtain ona particular occasion: 


Strict ComelationR ecovery lntegral(SCRI): 
JaUa(Eett Pre. p.0))o.A.0)du]oqveyer =41 v0 © o, 


In SCRI, I, denotes the product €4€g&c€p of the elements of the quadruple ¢; ©, = {0/04 + Op - Oc 
~ Op = +N}; and Q. is defined similarly with 0 replacing +m. Also A is the set of all possible A 
compatible with at least one setting quadruple in © (where © is the union 6,0); and M is the set of 
all possible jt compatible with at leastone possible A and setting quadruplein@. Measurable subsets 
of A and M and measures on them must be defined; and as usual we will suppose, only for 
simplicity, that our probability measures are absolutely continuous with respect to the basic measure, 
and so given by densities. But 4 prrar’ we impose no substantive restrictions on the sets A and M, 
ortheir measures. In particular, we allow the existent distributions in SCRI to depend upon the meter 
settings 0; and on other controllablemacroscopic parameters associated with the source and detectors 
(which we assume to be fixed VO € ©, and thus omit from our notation). 


Further, without loss of generality we can regard the set M as the Cartesian product 
MaXMpXMcXMp, where My is the set of all ji,, such that ji, is compatible with atleast one possible 
triple of apparatus existents [Lp.c,p (not necessarily compatible with all such triples!); and Mp - Mp 
are defined similarly. Of course this means that, in general, M may include quadruples, {1, that are 
incompatible with eachother. However, whenever a certain combination of apparatus existents is 
incompatible, the density, p(j/A,0), of such a combination must also be zero, leaving SCRI's 
integral unchanged from what it would have been had we not assumed M to be a Cartesian product. 
So our choice to define M as such a product really is just a simplifying convention with no physical 
implicationsforthe realist's recovery of QM's strict correlations. Further, when SCRI's integral is 
evaluated,it doesn't matter how probabilities like P(e/A.,) 0), that may happen to be conditional on 
incompatible combinations of existents, are defined, or what values these take - for, because 
incompatible combinations have zero density, our choice of how to define them (or what values they 
have) will simply never show up after evaluating any integral. In section 3's proof we shall exploit 
this freedom, and the freedom (without loss of generality) to regard [4 B,c,p a8 independentvariables 


News 
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throughout M (which is entailed by the latter being a Cartesian product). 


We turn now to the assumptions which our proof will show the realist must reject given the 
condition of recovery expressed in SCRI. There are only three, the first two being conditions about 
probabilities - specifically, about measurementresults being statistically independent of distant facts, 
bethey results, settings, or apparatus existents:- 


OutcomeFactorability (OF). P(e/A,p.8) = P(ea/A,p.8) PlepA,p.8) P(ecA,p.8) P(epA,p.6) 
Outcome Locatity (OL): P(EqAa}L 8) = Prea/A,pta.8a) ; P(epA,pt.6) = P(epA, tp. Op) 
P(E/A,L.8) = PlEc/A, [c.8c) 1 P(EpA. p68) = PlEpA. pp. 8p) 


In OF and OL we follow the axioms of standard probability theory so that P(e,4/A,p,0) = 
dep ety! P(e/A,p.,8), etc. Further, the notation used on the right-hand sides in OL denotes the 


common value of the left-hand side as the existents and settings which do not appear on the right vary 
throughout M and ©. 


OF and OL parallel two similar ‘locality’ assumptions commonly adopted in Bell's 
theorenf20); so we shall not rehearse their standard motivation in detail here - suffice it to say that 
theyareplausible giventhat: 


(I) Each measurement is undertaken at spacelike separation from the other three; 

(II) The causal history of each measurement device is sufficiently disentangled from the 
other three and from the particles inthe backward lightcones of the measurement 
events; and 

(IH) Spacelike causation does not exist. 


OF isthe assumption that we alluded toin 1 which plays the role of EPR's tacit ‘separability’ 
assumption in their argument. However, OF isin general weaker than separability (which we do not 
need to assume), if we adopt the most natural definition of the latter as asserting that the composite 
state of the four particles, A, is reducible to separate states for each particle individually which 
determine their composite state. True, inthe particular case where we assume the state vector of QM 
exhaustively describes the state of the particles, A, OF and separability coincide. For it is a well- 
known theorem of QM that conditions like OF holdif, and only if, the quantum state is a product (or, J 
‘factorable') state, i.e. a state which permits the individual states of the component systems to be Ne 
‘disentangled' (to use Schrodinger's term). So, for example, both OF and separability fail if A is just 
(>. 


However in the case of the de Broglie-Bohm causal interpretation(#!), which does not take 
QM's state ascriptions to be exhaustive, OF holds while separability fails. In its simplest form, their 
interpretation posits a A comprising the positions of our four spin-1/2 particles and |Y>, which 


ri 
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generates afield collectively guiding thefourparticles. But since |‘¥> is manifestly nonfactorable, A A ') 
is zot separable into parts which pertain to, or act upon, each particle separately. (In fact, this > jar 
nonseparability is exactly what forces their theory to violate OL once |'¥> becomes modified by its Ne 
interaction with the spin-meters- cf. Ref. 1, p.11.) On the other hand, A fixes measurement results 

deterministically, which implies that OF must hold. So the causal interpretation providesanexcellent 

example where OF holds while separability fails, demonstrating that they are not the same®, 


In addition to motivating OF and OL, (1)-(IIT) motivate the following assumption concerning 
theparticles' existentdistribution: 


Particle Locality PL): 
pA/d)=p(A) VA0EO 


Here again, the notation used on the right-hand side denotes the common value of the left-hand side 
as the settings which do not appear on the right vary throughout ©. PL permits us to define setting- 
independentp(A)-measures V subsets S of A: m,[S] = Jgp(A)AA. 


We now define 'Total Factorability' (TF) as the conjunction of OF and OL. And we define: 
Arp = {A|VLEM, V8EO , Ve: TF holds} 


Existent sets such as Arp, over which ‘locality’ assumptions are true, are not usually explicitly 
invoked in proofs of nonlocality. But we need Arp in order to make plain in 3 why GHZ's own 
proof no longer goes through in our more general measure-theoretic framework; and to facilitate a 
clear and complete statement of our theorem - which will be that PL and SCRI require the mj- 
measure of Arpto be 0. In effect: with measure 1, TF must fail. 


But before proving the theorem it is worth considering two important criticisms of 
assumptions like ours above; though they have been raised in the context of Bell's theorem, they are 
also applicable here. Our aim is to rebut these criticisms, and so keep the above assumptions 
plausible, by carefully defining the particle and apparatus existents which appear in our assumptions 
(cf. figure 1). 


The first criticism, raised by Jarrettl#), questions the stochastic independence of 
measurementresults when their probabilities are conditioned only on ‘initial’ existents, call them jp 
and Ag, encoding the state of the particles and apparatuses upto some time well before measurement 
(e.g. atthe time of the first spin-1 decay). Spelling out this criticism will help us justify why our OF 


5 Note that ow sense of separability is ax Howerl's (Ref. 22, p. 230, note 13) who chooses to define separability of 
the A - D particle states in a way which would render this notion aywicdiat to OF. Tha case of the cavsal 
interpretation shows how Howart's terminology is misleading. 
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conditions on ft and A (which encode states right upto measurement) rather than jlo and Ag. For 
brevity, cal! the disputed assumption OFo, for ‘OF with probabilities conditional on (j19,Ao)'. And 
let Vp = (U,A) and V = ([1,A), so that we can discuss both the apparatus and particle existents at 
once. Thenthe criticism is that OF cantrivially fail for realist theories in which the existents evolve 
indeterministically upto measurement (specifically, theories which predict that for at least two V- 
existents (compatible with Vo), say V' and Vv", 1>P(V'/vo,8)>P(v"No,8)>0). For conditioning ona 
distant result can alter the probability of a nearby result by giving information about how Vo 


evolved?. ez 


Admittedly, this criticism holds only if the information is probabilistically relevant to both 
results. This means the theory must either: prohibit the separation of each V-existent into parts, V4 
through Vp, causallyrelevantto eachmeasurementinteractionseparately; or permit this, but allow Va 
through Vpto causally interact with each other, or with distant results, in ways already ruled out by 
our motivating assumptions (I)-(1II). So one may wonder why we circumvent this criticism of OFp 
by assuming the weaker OF, rather than additional separability and locality assumptions. 


One reason is that OF's conditionalization on more than just the existents at the time of the 
first decay avoids some recently proposed nonquantum mechanical counterexamples"); for 
details, cf. Ref. 28. But our main reason is that, although additional locality/separability 
assumptions are classically plausible, they unnecessarily weaken our theorem's conclusion. Thus in 
figure 1 we have not separated the particles’ existent, A, into parts pertaining to the individual 
particles. So we allow the particles tobe ‘entangled’, asin the special case where A = |'¥>: after all, 
they (unlike the apparatuses) directly interact with eachother prior to measurement. Further, if a 
realist theory does ascribe separate states to each particle, figure 1's A. allows for the possibility of 
causal influences between these states at any time prior tothe 4-measurement. Indeed, it even allows 
for states of the source at the time of measurement (i.e. well after the first decay) to influence € and 
the particles: for these states will lie within figure 1's central ‘diamond’ bounded by null lines 
between the B and C measurement regions. Allowing A to ‘spread into’ this region, and into the two 
other diamonds between the A/B and C/D measurements, does not affect our proof, nor the 
plausibility of PL. (As long as A erc/udes each diamond's future boundary (composed of 2 null 
lines), PL will not fail by part of 2 being in the forward lightcone of the fixing of a meter setting.) 


The second criticism, originating in a debate between Shimony e¢ 2/29) and Belt(9), centres 
around PL. In the most general version of his theorem (Ref. 1, Ch. 7), Bell divides existents into: 
those in the overlap of the backward lightcones of the measurement events; and those in the remainder 


9 Av similor sygument has Deen widely wed to motivate OF (and even OFo!) om the grounds that its probabilities are 
conditional on «d/ tha details that might be cavsally relevant to the measurement results (ef. Refs. 1, p. 54; 24, p. 206; 
and 25, p. 579). 
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of these backward lightcones. The difficulty is that Bell takes the overlap existents to be the total 
states of the spacetime region so that they contain information not restricted to the particles’ initial 
State and preparation, and can hardly fail to influence the eventual setting choices 8. So his version 
of PL, which pertains to 2//of these overlap existents, isimplausible. To avoid this, Shimony eva/. 
opt for two hypotheses: that the set of overlap existents causally relevant to the measurement results € 
are just those which are intrinsic to or in the immediate spatio-temporal vicinity of the particles; and 
that their distribution can be regarded as a marginal distribution averaged over the extra overlap 
existents that Bell includes, The idea-of the first hypothesis seems to be that if one is more specific 
about which overlap existents are causally relevant to €, then a backwards causal entanglement 
betweentheseexistents and 0 is less credible. The second hypothesis weakens this even further by 
allowing that factors causally relevantto® inthe overlap do influencethe particles’ existent, but that 
when averaged out they leave its distribution independent of 8. Although these hypotheses restrict 
the scope of the theorem, Shimony ef a/ claim that they increase its plausibility since they are 
methodological hypotheses 'no stronger than one seeds for experimental reasoning generically’ (cf. 
Refs. 31 and 32 for further arguments which supplement theirs). 


We propose to follow Shimony eta/ Although we differ from them in our defining 
existents right upto measurement, we take our A-existents to be intrinsic to the particles - which is 
why, in figure 1, A's region does not completely fill any measurement event's backwards lightcone, 
nor their intersection. (Of course, exactly which existents are ‘intrinsic’ to the particles depends upon 
the realist theory at issue, so in this respect figure 1's A provides only a guide. But, ata minimum, A 
shouldinclude the classically expected trajectories of the particles, whichfigure 1 indicates. ) Further, 
we regard (A) as averaged over other factors relevant to @ in the overlap of the four backward 
lightcones 


Asimilar point can be made about the other existents causally relevant to € being intrinsic to 
the apparatuses (as with our |t), cather than total states of spacetime regions; and treating p(jL/A,9) as 
a distribution averaged over these states. However, since in 3's strictly correlated proof we do not 
need to assume that the apparatus distribution is independent of 8, there is not as much motivation to 
do this. But in both Bell's and our nonstrictly correlated proofs (discussed in4 and 5, respectively) 
there is such motivation, since there P(jl/A,6) is assumed to factor into a product of distributions 


pertaining to eachapparatus existent and setting separately. 


Having accommodated the Jarrett and Shimony efa/ criticisms, figure 1's spacetime diagram 
turns out to be very different from Bell's (cf. Ref. 1, p. 55, fig. 3). Nevertheless we shall retain 
figure 1's existents even when discussing Bell's proofsin4. (For an alternative approach, Ref. 28 N er 
surveys ways in which, adopting Bell's method of taking existents to be totalstates of spacetime _ 
regions, his assumptions can be made as plausible as possible. ) VY 


It ts worth discussing one final doubt Shimony (Ref. 33, p. 226(note)) has since advanced 
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as to whether the existents associated with the 4 apparatuses canbe /u//ycausally disentangled in any 
individual measurement trial. For even if an apparatus existent is sufficiently intrinsic (dispelling the 
concerns of Shimony ef a/), Shimony argues that it ‘is an enormously complicated and detailed 
compendium of information’ that seems bound to leave ‘some small trace’ upon the others, even 
without any spacelike causation. If Shimony’'s doubt is correct, this would have the effect of making 
the apparatus existents so strongly entangled, that no two quadruples of existents, 1 and |’, 
appearing in different measurement trials could share a common A, B, C, or D apparatus existent 
without at least one of them having zero measure (because it would have to be an incompatible 
combination of existents!). And this would render OL physically empty. 


For example, consider the equation P(E,/A,[L.6) = P(ea/A,[-a,8.a) that appears in OL. The 
right-hand side probability was defined to be the common value of the left-hand probability as the 
apparatus existents (and settings) not appearing on the right vary. Now if jt, figures in only one 
compatiblequadruple}i, then the variation of apparatus existents not appearing on the left can only 
lead to the consideration of probabilities conditional on incompatible existent combinations. 
(Explicitly, the above equation will imply that P(e4/A,p,0) = P(ea/A,t', 9) for all incompatible jL' 
containing }i,.) And since these probabilities have no physical content (and can be defined in any 
way one chooses without affecting our argument, as noted earlier), OL will inherit this lack of 
physicalcontent. 


There are two strategies one can use to deal with Shimony's doubt. First, one could grant 
that each apparatusexistentleaves svare trace on the others, but hold that there is no reason to think 
that such traces will be anything but small. So, for example, jt, will in general carry a small amount 
of information about the other existents [Lp cp - call this information p.4®:©.0, Since it is small, it 4 
plausible to suppose that it will be effectively causally irrelevant to the probability for any A- 
measurementresult,€,, i.e. P(ea/A,tLa.O4) = Plea, tla - paBOP,O4) Vita, where ‘La - 4BC.D is 
the A apparatus existent, less any smal! traces of information from the other existents. Given this, we 
can redefine each apparatus existent sothat it leaves out any causally irrelevant information about the 
other existents (i.e. redefine, as fla - [4BGD: Wp as jp - bp’, etc.); after all, this extra small 
amount of information should make no significant difference to the probabilities! This strategy 
restores the possibility that agiven existent canbe compatible with more than one combination of the 
other three existents, and so restores the physical content of OL. 


The other way is to follow Shimony's strategy in Ref. 33, and subsequent publications!*4), 
and drop TF in favour of the weaker assumptions: 


Outcomeladependence P(eik,8) = PreaA,6) P(egiA,8) P(ec/A,8) P(epA,O) 
ParameterIndepeadeace. P(E,g/A,8)=P(Ea/A,9 4) ; B(EBA,O)=P(ep/A,9 p) | 
P(Ec/A,8)=P(EciA,8.c) i P(ED/A,p,8)=P(EpA, 9p) 
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wherethe probabilities, P, are averaged over the |l existentsas: 
e.g. P(CR,@)=lyP(CA.q..8)0(UA, 8d P¢ewM.0)=In4[ DepovtyeasP(C%H.8)] PUA, dp, ete. 


The conjunction of the above two assumptions is equivalent to a jl-averaged version of TF, call it 
‘TE’. And it can easily be checked that TF is sufficient for our theorem in 3; just as it is for Bell's - 
the case Shimony is concerned with. So by using TF rather than the stronger TF, which in fact 
implies TF, given |'¥>'s strict correlations (cf. 3, Lemma 2), we can allow some scepticism about 
TF, in particular about OL's physical content, and yet still derive a contradiction with QM's 


predictions. Ne ») 

Weprefer the first strategy to Shimony's for three reasons, First, by using TF rather than 
TF in the next section, it becomes possible to show (cf. Lemma 2) that TF implies TF, given |Y>'s 
strict correlations - a result which sheds new light on the consequences of denying Shimony's TF 
(which has been surprisingly little discussed in the literature, nor even distinguished from TF). 
Second, as so often, there is a trade-off between using premisses as weak as possible for the task at 
hand, like Shimony's TF, and using premisses for which a simple physical motivation can be found, 
like TF, Being a trade-off it's a matter of judgement: ours is that TF is the best assumption to use 
because it is manifestly locality-motivated as soon as one adopts the first strategy above. Third, if the 
first strategy is rejected, itis not clear that denying Shimony's TF, which entails denial of TF (i.e. OL 
or OF), makes the realist's position undesirable. For as we've said, the realist can then be sceptical 
about the physical implications of denying OL. Since Shimony offers no other sufficient conditions 
for TF which the realist would be loathe to deny, a proof that shows TF to be in contradiction with 
QM''s predictions would have questionable import forthe realist. 


3. GENERALIZED ALGEBRAIC PROOF OF NONLOCALITY 


Having introduced some notation and motivated our assumptions, we now prove the main theorem: 
that given PL, and SCRI (the constraint of compatibility with |Y>'s strict correlations), my[(Arr)’] = 
1 (where V subsets S of A, S’ denotes the set theoretic complement of S). That is: the measure of 
particle existents which violate either of our ‘locality’ assumptions (Outcome Factorability or 
Outcome Locality), for at least one quadruple jt of apparatus existents (and so settings 8), and one 
quadruple € of measurement results consistent with the strict correlations (or anti-correlations) of our 
Gedankenexperimenfmust be 1. After proving this, we shall explain why GHZ's own proof no 
longer goes through in our more general measure-theoretic framework. 


We begin with alemma. Let I(A,6) be a shorthand for SCRI's apparatus existent integral (in 
square brackets in 2). Let @*, be countable subsets of O,, respectively. We need not commit 
ourselves to wérch countable subsets @*, represent until the final stage of the proof below. (In fact, 
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to do so at this stage would not only make our notation more cumbersome, but fail to bring out the 
fact, emphasized below, that many choices for @*, are sufficient for the final stage of our proof 
below - a fact which strengthens our proof.) Let A’gc = {AJI(A,8) = +1, VOEO*, and I(A,8) = -1, 
VOEO"}. So A%ge's superscript and subscript indicate that countably ('c') many of the strict 
correlations and anti-correlations (‘SC’) predicted by |'¥> are mirrored by eachA € A'gc. 


Lemmal: - PL a SCRI > my([(Are)'] = 1 - ma[AteNA’sc] 


Proof 1 - It is easy to show that SCRI implies YOE@,, maf{A]KA,0) = +1}] = 1 (for either upper or 
lower signs). Using the following theorem of probability theory: 


my [Sj] = my[S2] =... = my[Sq] =... =1 > mg [SyNS2N... 5M...) =1 
V countable sequences of sets Sq,52, ...5q.... (3.1) 


it follows that ma[A°sc] = 1. Again using probability theory: 
ma[(Are)"] = 1 - mafArp Asc] - ma[Ate(A®sc)'] (3.2) 
But since mg [(A%sc)" ] = 0, the last term on the right in (3.2) is 0. QED. 


For A within Arp \A%sc, let us define the (in general, stochastic) ‘result’ function 
AGA, pLa.8a) = P(+1/A,JL4.9,) - P(-1/A,[14,0,) which encodes the expectation value of A's spin for 
given existents (similarly for B-D). Theright-hand side is well-defined: recall that within ArpOA'sc, 


A-probabilities depend on}. only via ji, and so on 9 only via O,, etc. The next lemma shows that ] !) 
there must also be no [1 4-dependence: so we will be able to write A(A,0 ,), etc. [Siep ae 
v ‘ 
Lemmaz2: - LetAC Arp Asc. Then: ee , ¢ ab oil 
(1) VOE@,°U8*, A(A, 4,84) takes a common value A(A,8 4) VjLAEM; ro? | 
and similarly for B-D. kt ‘ 


(2) Y6E@,*, these common values obey A(A,6,)B(A,05)C(A,8c)DA,Op)=+1; and 
(3) V6EO *, they obey A(A,8,)B(A,85)CA,8c)DA,Op)=-1. 


(Note that there is no immediate contradiction between the equations in (2) and (3), since 
6,°NO"=@. However, in the main theorem to follow we show that there exists a choice of ©,° and 
@.°, infact many such choices, which imply a contradiction between the values assigned to one of the 
four terms in common between the two equations appearing in (2) and (3).) 


Proof 2 - First note that each AG Ap Asc, by the latter set's very definition, must satisfy: 
YOEO*,, 10,8) =f, AQ. HA.O) BA, He, OB)CA, pc, 8c)DA.Hp.Op)p(HA,8 dt = +1 (3.3) 


where for each 4, (3.3) must hold for both upper and lower signs (by Asc's definition). We 
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establishthe lemma via evaluating the integral I(A. 0) for both cases. But first note that for all settings 
in ©*,, therange of integration in I(A 6) (viz. over the set M) is the same. This does not mean that 
we have presupposed that every |LE M must be compatible with every 9€©*°,. For when we defined 
M, we allowed it to include apparatus existent quadruples which may be incompatible with some 
settings, or combinations thereof. And as earlier stressed, since these must have zero measure, they 
make no difference to the value of 1(A. 8). 


Consider now the upper sign case in (3.3), and a particular A. (3.3) clearly implies that 
VOc@*,, my(A,0)[M*] = 1; where the measure my(A,@)[S] is defined (as usual) as J gP(W/A.8)dpt Vv 
subsets S of M, and M+ = {t|ABCD=+1}. Now, in general, M* will be a subset of M. But since 
(Mt)’ has measure-zero, we can define probabilities conditional on {L's outside M*, but inside M, 
any way we like without affecting the value of I(A,8). So we can choose to define these so that 
they also satisfy ABCD=+1. So far, then, we have VOEO*,, VWPEM: ABCD=+1. 


Now we proceed by ‘separation of variables’. Fixing jlp.c,p in M fixes the value of A in 
this set independently of its argument la, since AfA,a0a) = 


~~ +1/[B(Q, t,p)CQ.,j1c.0c)DQ.jLp,Op)] throughout M. However, all jt, in M must be compatible 


with each fixed triple of existents |Lp, [tc, and [Lp in M, because the latter is a Cartesian product. 
(Recall from 2, that we were free to choose M to be such a product without making any difference to 
the value of I(A,6).) Thus A(A,}14,0.4) must be a constant throughout the set M - a constant we 
denote by A(A,0.4). Continuing with the same reasoning yields that B, C, and D must be constants 
B, C, and D, Vit, fc, atid pEM, respectively. Because of this constancy, we can take I(A,6)'s 


integrand outside ofthe integral, yielding: 
VAEATP UA gc, VOEO*,, AA,Oa)BA,Op)CA6c)DA Op) =+1 (3.4) 


because our initial choice of AC AtpNA*g¢ was arbitrary, and my(A,0)[M] = 1 (by normalization). 
Using a similar argument, one can start from (3.3) with lower signs and show that: 


VAG ATP sc, VOEO*, A(A,0,)B(A,6p)C(A,0c)DA,Op)=-1 (3.5) QED. 


Note that because of Lemma 2, not only have we obtained independence from apparatus 
existents, but throughout the set ArpOA%gc the stochastic framework we began with has collapsed — 
into a deterministic one, for countably many strictly correlated and anti-correlated settings. For the 
fact that each of A through D liesin the interval [-1,+1], coupled with Lemma 2's constraint thattheir 
product equal +1, implies that escé of A through D must equal +1 individually. It is also easily Aw) 
checked that Lemma 2's conclusion implies Shimony's TF, thus establishing that if TF and |‘¥>'s 
strict correlations hold in any A-state, that state must also satisfy TF - as claimed at the end of section 
2. 


Using the above lemmas, we can now prove the main theorem: 
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Theorent - PL «a SCRI > m[(Aqe)'J] =1 


Proof: - We need only show that Arp A%sc is the empty set; for then my[Arp A‘ gc] = 0, and by 
Lemma 1, the theorem follows. To show Arp A'sc is empty, we show that there exists a choice of 
sets O°, and O° such that there cannot existaA € App Asc satisfying both equations in Lemma 
2. Choose: 


Of%,= {<84,05,9¢,6p>,<04-1/2,05+0/2,0¢,8p>, 
<O4,05+1/2,0¢+1/2,8p>,<O4+N/2,0p,0¢+1/2,8p>} 
O% = {<O4-9/2,05,0¢+1/2,0p>} (3.6) 


where <0,4,85,0¢,0p> is an arbitrary quadruple in ©,. (The latter guarantees that O°, and ©*. are 
indeed subsets of ©, and ©., respectively, as required by their definition.) With these choices, 
Lemma2impliesthateachA € App A'gc must satisfy: 


A(A,04)BA,0p)C(A.8¢)D(A,Op) = +1 (3.7a) 
A(A,9 4-11/2)B(A,O+1/2)C(A,0¢)D(A,Op) = +1 (3.76) 
A(A,04)BQ.,8 ptN/2)C(A, 8 c+0/2)D(A,0p) = +1 (3.7c) 
A(A,0 4+1/2)B(A,0p)C(A,Oc+/2)D(A,Op)= +1 (3.74) 
A(A,0 4-91/2)B(A,0 )CA,8c+N1/2)D(A,0 p)= -1 (3.7e) 


By simple algebra, eqns. (3.7a-d) together imply: 

A(A,0,t0/2) = A(A,0 4-1/2) (3.8a) 
while eqns. (3.7d,e)imply: 

A(A,6 4+1/2)= - A(A,0 4-1/2) (3. 8b) 
Since all result functions must be deterministic (+1), eqns. (3.8a,b) are contradictory. QED. 


The above contradiction can be derived for many other choices of the sets @,° and 0°. 
Some of these correspond to: (a) the uncountably many different ways to choose the setting 
quadruple (0.4,95,0¢,8p) defining the sets chosen in (3.6); (b) the arbitrariness in our chaice to hold 
8p constant throughout O*,\ 0%. rather than one of the otherthree settings; (c) the arbitrariness in our 
choice to derivethe contradiction in terms of the A result function rather than one of the other three 
result functions; and (d) the arbitrariness in the quadruple <84,65,9¢,8p> being taken to be an 
elementof9,, rather than 6. (choosing ©_ interchanges our choices for the sets 0,° and © while 
preserving the contradiction: the + 1's in eqns. (3.7) become -1's and vice versa). 


These various ways to choose ©,° and © * compatible with achieving acontradictionrule out 
the possibility that there may be contradictions only overa set of setting quadruples of measure-zero. 


New) 
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And so they help to make our theorem stronger than it stands at present. For we could have initially 
assumed that TF holds V8c@*, 6% ratherthan VOEO ,UO.; and similarly weakened the universal 
quantifier in PL's definition (cf. 2), With these modifications, the proof goes through analogously, 
and establishes that, with probability 1, at least one of Outcome Factorability or Outcome Locality 
must fail for at least one possible quadruple of apparatus existents, and measurement results; and for 
uncountably many setting quadruples satisfying 04 + Op - Qc - 8p =+H or O (not just for atleast one 
such quadruple). 


~ 


We shall now explain why GHZ's original proof is no longer valid in our measure-theoretic 
framework. As mentioned in 1, GHZ start with the existence of deterministic result functions whose 
arguments encode the initial particles' existent, Ag, back at the first decay. These functions are 
assumed independent of apparatus existents but satisfy OL; OF is trivially satisfied because it is 
implied by determinism. Invoking |’>'sstrict correlations, GHZ claim that: 


FAgE Ag (=the set of all possible initial particle existents) suchthat: 
A(Ag,84)B(A0,95)C(A0,8c)DAp.8p)=+1[-1] VO such that O4+0p-O¢-Op= [0] (3.9) 


without restricting this assertion to a countable number of setting quadruples which satisfy the right- 
hand equation, as in our Lemma 2. Indeed wyéé such a restriction, their suggestion for a proof of 
contradiction cannot be carried through. For they say '...we can continvously vary two of the 
parameters {i.¢. settings] while keeping the other two constant, This leads to the conclusion that 
A=B=C=D=constant. But this is impossible, since the product [i.e. on the left above] sometimes 
equals +1 and sometimes equals -1' (italics ours). 


True, using a proof analogous to the final stage of ours (but slightly more involved) one can 
readily derive from (3.9) the conclusion that A(Ag,04) = A(Ao,04+6) for arbitrary 5€[0,2%) (and 
similarly for B-D) using continuously many strictly correlated setting quadruples (or continuously 
many strictly anticorrelated quadruples). And this implies that 
A(Ag,8,)B(Ap 6 p)CA0,8c)DA0,8p) = constant! for all possible setting quadruples, in 
contradiction with (3.9)'s assertion that for some quadruples strict correlation must be predicted by 
Ao, and for others strict anti-correlationsmustbe predicted. 


However, this proof turns on wrongly assuming that (3.9) follows from |¥>'s strict 
correlations without a ‘countably many’ restriction on settings satisfying the equations §4+6p-8c- 


10 We found no ‘way to prove from (3.9) that all these result functions must be the nuze constant, as GHZ assert. In 
any case, a contradiction follows from the fact thet each is cadsravwuliy 0 constant. Since writing this paper, we have 
checked that all the details of ow interpretation of GHZ's suggested proof mre correct by referring to Professor 
Greenberger’s unpublished lectwra notes (of. eqns. (18}-(28) in Section VIL: ‘Going Beyond Bell's Theorem—1. Efficient 
Govaters'). 
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Op= N[0]. In the more specific class of theories envisaged by GHZ, these strict correlations must be 
recovered, givenPL, bythe integration: 


| 4,A(0,84)B(A0,85)C(Ao,8c)D(Ao,Op)p(Ao)dAo=+1[-1],'¥8 such that 8.4+8p-8c-Op=M(0] (3.10) 


which is a special case of our SCRI. We shall simply give one of many counterexamples where 
(3. 10) is true but (3.9), without restriction, is false. 


Take Ap to be a continuous set of existents; and suppose that (for either upper or lower 


signs): 
VOE@ u, (AdlA(Ag,84)BQ0,95)C(Ap,8c)D(A0,Op) = +1} = Ao - {Ao(8)} 3.11) 


where A(@) is a single element of Ay - wArch element depending upon 9!!, Then since VOEO,, 
mmigLAo - (Ro(8)}] = map fAa] = 1: 


VOEO x, may {{Ao/A(A0.84)B(Ag,85)CAp,8c)DAo.Op)=+1}] = (3.12) 


which makes (3.10) true. Suppose also that as 0 varies throughout ©,, Ao(@) varies throughout Ap. 
Then: 


{AglA(A0.84)BQ0,88)CA,8c)D(Ap,8p) = +1, VOEO,} = 
eco, (AolA(Ao,04)B(A0,85)CA0.0c)D(Ag,Op)=+1} = (3.13) 


Vag(0)eAq [Ao - {20(8)}] = 2 
thus (3.9) is false. 


In short, the final stage of our proof is more than just a simplification of GHZ’s to a finite 
number of settings. This finiteness saves their proof from measure-theoreticinvalidity. However, to 
be fair to GHZ, they do not use measures; since they launch their argument from EPR's premisses 
which equate measure-one with certainty (specifically, EPR's sufficient condition for identifying | 
elements of reality). Aad it is easy to see that equating the two circumvents the trouble we have N ew) ’ 
identified above. However, we regard it as interesting that without equating the two (which is, in any 
case, illegitimate), their proof needs to be modified in a nontrivial way to establish their result. V 


Since we pointed outthe above measure-theoreticinvalidity in GHZ's proof, Greenberger er 


I} This des pot Mean thet ow counterexample somehow violates locality by moking particle existents depend on : 
settings, The notation ‘Ag(@) is merely meant to encode the fact that, in general, for different settings, different New 
measwe-one sets of Ag will predict strict correlations. This follows from the result functions depending on 0 as well as 

Ap, so that different settings may lead to different predictions dictated by the same Ap-state. 


Citog, Andina sof Sothwtelt 19 Goawalinal Abwirusr Mahwah Foo 


2/.(5) have proposed an alternate proof of contradiction in order to establish their result. We would 
like to explain how the measure-theoretic trouble that occurs in GHZ's original proof is shared by this 
alternate proof. We are motivated ta do this, partly because these authors explicitly claim that their 
proof goes through without equating measure-one with certainty, and partly by the fact that it allows 
us to explain the trouble in a different, perhaps more transparent, way. 


The main difference between GHZ andGreenberger et a/.() is that the latter (like us) aim to 
prove acontradiction directlyin terms of one of the result functions, A, rather than all four. Using 
our notation, the main part of their proof starts from the assertion (cf. their eqn. (11a)): 


If 64+05-0¢-Op= 0, then A(X,04)B(A,8p)C(A,8c)DQ.8p) = -1 (3.14) 


They then choose four appropriate quadruples of settings satisfying (3.14)'s antecedent, and deduce 
from itthat: 


A(A,20) = A(A,0), for agiven angle > (3.15) 


But since the angle ¢ which figured in each of their setting quadruples can be chosen arbitrarily, they 
conclude (cf. their eqn. (16)): 


A(A,20) = A(A,0) = constant, Ve [0,2m) (3.16) 
(3. 16)then contradicts the equation: 
A(A,0 + %) = -A(A, 0) for agiven angle (3.17) 


derived from another appropriately chosen setting quadruple satisfying the strictly correlatedanalogue 
of (3.14) (i.e. with treplacing 0, and +1 replacing -1). 


We do not question the deduction of (3.16) per se - only Greenberger era/'s claim (cf. Ref. 
5, note 14) that it continues to go through if measure-one is not equated with certainty. For suppose 
we do not make this equation. Then all that one can infer from the strict anti-correlations |'Y> predicts 
is that the set of A's for which (3.14)'s consequent is true has measure-one. And so, instead of 


(3.15), one can only derive: 
For a given angle g: ma[{A|A(A,26) = A(A,0)}] = 1 (3.18) 
of, recognizing that the choice of was arbitrary: 
VoE[0,2m): ma[{A|AA,2) = A(A,0)}] = 1 (3.19) 


Our pointis that one cannot derive (3.16), which (more explicitly) requires: 
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=A. such that Ve (0,27): A(A,2) = A(A,0) = constant (3.20) 


from (3.19). The reason is as before: For an uncountable infinity of 's (such as all those in the 
range [0,20)) itis perfectly possible to have: 


 Yuacounabte sot of ¢ {AJA(A.26) = AQ,0)} =! (3.21) 


which would illustrate the falsity of (3.20)~and a/ortrorz of (3.16). 


However, by choosing acountable, orindeed finite, set of , this problem is easily avoided. 
Because if (3. 19) is true: 


MAL Veountadio oo of » (AAQA,24) = A(A,0)}] = 1 (3.22) 


So there are plenty of values of A for which A(A,2) = constant for any couatab/eset of o's. In fact 
we can use this to rescue Greenberger e¢a/ 's proof. For to get the contradiction they desire between 
(3.16) and (3.17), they do not need to use the full strength of the universal quantifier over 
uncountably many q's in (3.16). In their derivation of (3.16) they could just specialize to @ = 1/2, 
while in deriving (3.17) they could specializeto ¢ = 0, yielding the contradiction: A(A,0) = -A(A,0). 
And since the choice of 0 is arbitrary, it is still possible to generate uncountably many such 
contradictions by running through the argument uncountably many times! But of course this then 
leads back to proving the contradiction in essentially the way that we ourselves have (cf. 
contradiction between (3.8a) and (3.86), and the discussion about strengthening our theorem th 
follows). 


To sum up, we regard our own proof, discovered independently of GHZ's and Greenberg 
etal's, a8 a nontrivial improvement on theirs. For it is surely logically possible that a m 
theoretically valid proof could not have been given - whereas our proof shows that the piys 
permits one to rule out this possibility. 


4. COMPARISON WITH OTHER ALGEBRAIC PROOFS AND BELL'S PROOF 


We shall first briefly compare section 3's proof to other algebraic proofs of nonlocality, before going 
on to compare it in more detail to Bell's statistical proofs. Our discussion will presuppose some 
acquaintence with the Bell-Kochen-Specker and Bell theorems (as discussed, for example, in Refs. 
1, 11, 35 and 36). 


The first published algebraic nonlocality proof was given by Heywood and Redhead (cf. 
Ref. 11, Ch. 6; and Refs. 37, 38), They realized that although the extension of Maczynski'stheorem 
allows one to assign elements of reality to a composite quantum system as functions of locally 
maximalobservables39.40), one could nevertheless obtain a finite Bell-Kochen-Specker ‘colouring’ 
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contradiction for these systems by imposing, in addition to purely algebraic constraints on the 
elements ofreality!2, a Bell-type locality condition. A simpter, more general proof was given by 
Stairf*2), and more recently Brown and Svetlichnyl4?), They do not require ay independent 
algebraic constraints on the elements of reality: they rely solely on a locality constraint and yet still 
arriveatacolouring contradiction. 


Elby4) has recently generalized their proof so that it applies to stochastic realist theories in 
which apparatus existents are causally relevant. Elby considers measurements undertaken on two 
correlated particles inthe spin-{ singlet state, and exploits the strict correlations which obtainbetween 
the spin Hamiitonian of one particle and spin of the other to deduce a contradiction with TF and PL. 
As in our proof, locality assumptions together with the strict correlations force the realist theories at 
issue to 'EPR-collapse’ to deterministic theories independent of apparatus existents (although the 
arguments we use to prove this collapse are somewhat different). Also, in both proofs the crucial 
final contradiction does not require any independent algebraic constraints. So both our proof and 


Elby's improve on previous proofs. 


Further, Elby (like us) proves a contradiction between elements of reality (whose existence 
he denve} that pertain to just oze of his two particles. However, Elby's proof still relies on the 
somewhat technical Bell-Kochen-Specker ‘colouring’ theorem which requires roughly 100 carefully 
chosen observables for which elements of reality exist, whereas our proof requires only 3 A-spin 
observables. 


So much for how our proof improves upon other algebraic proofs. We now tum to 
discussing how it also improves on Bell's statistical proof. But first, it is important to be clear on 
how it doesn't improve on Bell's! We are motivated to do so in light of GHZ's claim that ‘from the 
point of view of reality’, their proof's restriction to the 'superclassical’ (i.e. strict correlation) case 
makes it more interesting than Bell's. We disagree: our reason being that the same Bohrian response 
can be made to both proofs. ‘ 


Both our proof and GHZ's (like Bell's) consider incompatible measurement contexts. In 
particular, the final contradictions are derived from statements about elements of reality (encoded by 


12 Specifically, Heywood and Redbead assumed ‘FUNC™’, which in part requires that the elements of reality assigned to 
maximal observables mirror the Functional relationships between the corresponding operators, but whose physical 
motivetion wes not fully clear, Elby(4!) has recently derived FUNC” from other physically motivated assumptions. It 
is interesting to note that ow eqn. (3.8)) is a consequence of FUNC™, and tha ow derivution of (3.8b} from locality 
and |'¥>‘s strict correlations corresponds in effect to a simplified version of Elby's derivation for that particular instance 
of FUNC™. This helps one see why ow proof does not need to independently assume any algebraic constraints on 
elements of reality - for say constraints that we needed, ove derived (com plete with the elements of reality, them salves!) 
from locality and |'¥ >'s strict correlations. 
(Aes 


Nun 
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the various result functions) that cannot all be displayed in a single measurement context. But we 
know from Bohr's original response to EPR(5) that he would reject any step ina proof which passes 
from one context to another incompatible context while retaining the definiteness of all elements of 
reality atissue. But just which step is this? 


Consider first our proof. Since Bohr regarded QM as providing a complete description of 
physical ceality, wecan only legitimately talk of his response to a version of our proof that takes the 
QM state-vector to be (or, more weakly, to determine) an exhaustive specification of the particles’ 
existent,A. In that case (as mentioned in 2) OF holds if and only if this state vector is a separable 
state. But|'Y> is manifestly nonseparable, Bohr no doubt would have denied OF. (And since Bohr 
writes that ‘any direct mechanical interaction of the system and the measuring agencies is 
excluded{“6), thus ruling out manifestly causal nonlocal connections, he presumably would endorse 
OL?8).) More specifically, our contradiction arises from assuming that OF holds for all contexts 
regardless of whether or not they are compatible. Soin the case where A = |‘¥> (the only case Bohr 
would recognize as legitimate), we effectively assume the particles’ joint state divides up into parts 
once and for all, ina context-independent way. This is opposed to Bohr's response to EPR in which 
he effectively stresses that the particles’ state ‘splits' up into parts in a way that depends on the 
measurement context. (Put in terms of wavepacket reduction - the basis in which reduction occurs 
for the joint 4 particle system depends on which measurements are performed.) 


As for GHZ's own proof, they preface it with EPR's original (jess formal) argument and so 
inherit Bohr's objection to the ambiguity in their phrase ‘disturbance’ - which could mean either the 
‘splitting’ induced by the measurement context discussed above, which Bohr accepts, or a more 
obviously ‘causal’ violation of OL, which he rejects. Admittedly, it is tempting to think that Bohr's 
position is more tenuous because all the measurement contexts GHZ consider admit strict 
correlations, and so elements of reality. However, Bohr would hold that once a certain context is 
realized and elements ofrealityattributedin that context, this blocks attributing elements of reality to 
otherincompatible contexts which aren'tinfactrealized!3, 


Now to improvements on Bell's theorem. Famously, it requires only two of the four 
particles in our Gedaakenerperimenswhile our proof required at least three!4. But there is a link. 


plod 


13 gince writing this paper, Greenberger et auf (Ref. 5, note 10) have themselves stressed that Bohr would reject their Ae ) 


argument, in complete agreement with what we've said here. 

14 Fadeed, since both ow theorem and Bell's can be applied without wsing all particles, ow proofs remain valid using 
weekened locality assumptions. For example, OF can be weakened so thet oaly one (in Bell's case) or two (im owrs) of 
the fow outcomes is ‘factored’ out of OF's conditional joint probability for the fow owcomes. Simily weakeniags are 
possible for OL sad PL, but we shall not pursue this matter here in order to keep things simple. 
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Suppose this Geduakenexperimeat is performed with 8p p constant throughout so that the 
correlation coefficient predicted by |'¥> reduces to -cos(94 - 8c + A), where A (= Op - Op) isa 
constant: call this correlation CCjg,(64,0c,A). It is well-known that there exist araqy quadruples 
04,94',8c,9c' (no pair of which need admit strict correlations) such that”): 


ICCyy, (8,,8¢,0)+CQy, (84,9¢' , 6)+CQy, (8, , 8c,0)-CQg, (8,' » Qc 9)| 
=2+ £04,84'.8¢,8c',9) >2 (4.1) 


thus allowing Bell's theorem to be applied to the A and C particles. 


In its most general form (Ref. 1, Ch. 7) this theorem begins by adopting OF, OL, and PL, 
VAEA,V04,04',8¢,8c' EI (where F = {04,04',0c,0c'lf > 0}). In addition, Bell makes two 
furtherassumptions, motivated by locality, concerning the apparatus existent distributions: 


VpA 84.84" ,8c,0¢' Er -- 
Apparatus Factorabilig( AF): p(A,8)=p(aA.8 p(ipA,8)p(icA,8 )p(ip’,8) 
Apparatus Locality (AL). p(}a/A,8)=p(ta,6 4):P(lpA,8)=p(ip/A 8p); 
P(A, )=p(tc/A,8c);P(tA,8)=p(LpA,8 p) 


Weshall call he conjunction of these assumptions ‘Total Apparatus Factorability’ (TAF). 


In precise analogy to OF and OL, the probability densities p(j14/A,9), etc. appearing in AF 
and AL are defined in the usual way as marginals obtained from the distribution p(/A,6); and the 
distributions on the right-hand side of AL's equations are defined as the common value of the left- 
hand sides as the settings not appearing on the right vary. Further, as discussed in 2, TAF is most 
plausible when its distributions are taken to be already averaged over all other existents in the 
spacetime regions to the past of the four measurement interactions. 


With these assumptions, the integrand in SCRI's integral (which gives the existent 
expression for the four spin correlation prior to A-averaging) reduces to 
AQA.0,)BQ,05)CA,8c)DA,Op), where AMG) = hy AAHASa)PtaA.Padta, etc. (Note 
that since, without loss of generality, M can be regarded as the Cartesian product MaXMpXMcXMp, <— N ‘enn 
Bell can indeed factor SCRI's integral into four integrals of thistype.) Using the fact that |AJ, IB}, ICI, 
and |D| < | for ali values of their arguments, Bell's inequality can thus be derived: 


V64,04',8c,0c' El, |AA,04)B(A,On)CA,8c)DA.0n)+AR.0 4)BA.05)CA,6c')D(A,Op) 
+A(A,84)BQ,0p)CA,8c)DA.0p) - AA.04')BA,On)CA.0c')DA,8p)| 2 (4.2) 


Multiplying (4.2) by p(A) and integrated over A constrains the (actually measured) A-averaged 
correlations in contradiction with (4. 1). 
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One difference between Bell's theorem and ours is that his yields a statistical 
contradiction, whereas ours leads to an algebraic one. So in one respect, our conclusion is stronger. 
For the strongest conclusion that can be deduced from the contradiction between (4.1) and (4.2) is 
that at least one of Bell's locality assumptions must be violated for a measure of A-states of at/east 
f/2. Since the maximum value f can take is 2(/2 - 148), this measure’s highest value is just over 
2/5: much less than the measure 1 set of A-states required by our theorem to violate our locality 
assumptions! 5. So ourtheorem goes beyond Bell's by proving that realist theories able to reproduce 
all QM's predictions must violate locality assumptions (given appropriately chosen settings) in 2// 
but a measure zero of trials; as in the de Broglie-Bohm theory(@!] where the quantum potential is 
never ‘turned off’ from one trial to the next. (By contrast, one can easily show that the simple model 
devised by Bell (Ref. 1, pp. 16 - 17) violates OL for no more than the minimum measure of A-states, 
f/2, required by Bell's theorem. Thus our theorem shows it is incapable of reproducing the full range 
of |'?>'s4 particlecorrelations without significant modification. ) 


Asecond difference isthat our proof does not require the extra assumptions AF and AL, 
since we derive the independence of outcomes from apparatus existents via our other assumptions 
without imposing any conditions on their distributions (cf. 3, Lemma2). Physically, this means that 
our theorem rules out a class of apparatus nonfactorable [and apparatus nonlocal] theories which, 
while satisfying OL so that each A-result depends only upon the jt a-existent (and A and 64) in the 
immediate causal history of this result, violate AF [or AL] so that the probability of ji,'s occurrence 
depends upon (say) the distant C-apparatus existent [or setting]. 


True, the motivation for assuming AF and AL is the same as that for assuming OL. But as 
always, it is worth weakening the locality assumptions used in these proofs, so as to get stronger 
conclusions about what kind of nonlocality they require. And although there are no clearcut examples 
in the literature of TAF-violating theories where OL holds, such theories are far from vacuous. In 
particular, ajta-fic[-8c] stochastic dependence of the kind prohibitedby AF [AL] is compatible with 
the assumption (contained in OL) that the A-result is tochastically independent of pc[and Oc] far 
fixed 4. For one must distinguish between: pc[or 6¢] and jl, sadepeadeatly causally influencing 
the A-result statistics (i.e. two distinct causes of a common effect) which OL prohibits; and |tcfor 9c] 
causallyinfluencing thesestatistics r7# [1,4 (i.e. atwo-step causal chain) which OL allows! ©. 


15 & may de possible to improve on Bell's measwe by considering higher-order Belltype ‘facet’ inequalities(49), or 
‘chained’ Bell inequalities (50), 


16 Ruling ow the Eirst type of dependence while allowing the second is anslogows to imposing the restriction on the 
function F(x{y),y) that & equal F(x{y]) Vy, while contiaving to allow x itself to depend upon y. Indeed, classical 
deterministic analogues of these two types of (causal) dependence abowad. Recall that: (I) the force on a swfaee from 
which two (masskess) Hookean springs with spring consteats kj ani ky are suspended in parallel is F(x,y) = kyx + 
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Itis of course true that Bell could have avoided TAF by reinstating in his proof enough strict 
correlations to permit a derivation much like ours of the independence of outcomes from apparatus 
existents. (Indeed, Bell's informal EPR argument for determinism in his early proof (Ref. 1, Ch. 2) 
comes close to doing this, although he makes no expli itreferenc to apparatus existents, nor to their 
distributions.) On the other hand, Bell would then be sacrifi 3 one of the main merits of his later 
proof - viz. its not requiring experimentally unrealizable strict correlations. We show in the next 
section that our proof shares this merit once TAF is added to its assumptions. * 


5. THE NONSTRICTLY CORRELATED VERSION OF THE PROOF AND A NEW 
TESTABLE 'QUASI-ALGEBRAIC' LOCALITY INEQUALITY 


The following is a nonstrictly correlated version of section 3's algebraic nonlocality proof which 
applies to roughly the same class of theories as does Bell's nonstrictly correlated proof, and also 


leadstoatestablelocalityinequality. 


For the theories at issue we must weaken section 2's SCRI to: 


NogStrictCorretationRecoverylategral (NSCRI): 
IAL (SelTeP(eA.w.0))oqun.0 yd ]Jo@veysA = Cc(@) ve 


where we can either: 


(1) take CC(8) to be the experimentally observed correlation fora ® for which QM predicts 
strict correlations (the difference due to inefficiencies of spin-meters - cf. Ref. 1, pp. 86-7; 
and 46, p. 1889); or 

(II) take CC(8) to be a OM-predicted correlationfor a © for which QM does not predictstrict 
correlations (either in the case considered up to now, where the particles travel in precisely 
opposite directions along the z-axis; or in the more practical case where they don't, in which 
case QM does not predict strict correlations in state |"¥> anyway). 


In either case CC(8)¢ {0,1}. Now we define: 
Ansc = {AJI(A.,6) > 0, V8EO4 and 10,05) < 0} 
where, as in3, I(A,0)is a shorthand for NSCRI's apparatus existent integral; and: 


04={<64,0p,9¢,0p>,<0,', Op’, O¢,8p>,<84,05',8c', Op>,<O4'+7,0p,0¢',Op>} = {61 ,02,03,04} 
85 = <6,4',0p,0¢',8p> 


kyy, where x and y ore their exteasions; while (I) the force with the same two springs suspended in series is F(x{y)) = 
kyx, where x= (kylky}y. 
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where 6,4,0,4',95,95'.9¢,9¢', and Op are arbitrary settings. (Asin 3, there are many ways to 
structure one's choice of settings compatible with a valid proof of contradiction; but, for simplicity, 
we shall not attemptto account for them all. ) 


We present the proof in two stages, beginning with a lemma: 
Lemma -PL A NSCRI = mfAngc] 2 CC(O1) + CC(@2) +CC(83) +CC(84) - CC(Os) - 4 


Proof: - Define A,o(8) = {AJE(A,0) > 0} and, similarly, A.o(0) = {AJI(A,6) < 0}. By NSCRI and 
PL: 


VO, COB) = [A1A.89p RIAA =|, coyH.eyp yer + {4 pret. OPM 
> OmiAo(@)] + -LmlAa()] 6.1) 
and <  tmylAg(®)] + OmfA0() 


Thus ma[A,o(8)] > CC(@) and ma[A.(8)] > - CC(6), VO. In particular, ma[A,o(8)] > CC(6) 
VOE@4, and mf A.o(85)] > - CC(O5). Using this, the lemma follows by applying (for n=5) the 
following theorem of probability theory (easily proved by induction): 


If ma [Si] > q1, mg [So] > qo, ...,.ma[Sg] 2 qa,... where Sy,S2, ...Sp, ... 
is a countable sequence of sets, then ma[ Sa] 2 (Zn4qa) - (n- 1) QED. 


The idea of the theorem is to put a lower limit on the measure of particle existents which must 
violate either Total Factorability, or Total Apparatus Factorability, for at least one of the settings 
81 ,62,03,04, or O5, and at least one quadruple of apparatus existents and measurement results. So 
wedefine: 

Arratar= {A|VPe M,V6EO4U {5}, Ve: TE and TAF hold} 


in order to put a lower limit on this set's complement. (Clearly the theorem is only significant as a 
proof of nonlocality if this lower limit is positive - more on this below.) 


Theorem -PL a NSCRI => my[(Arp,tag)’} = CC(81) + CC(82) +CC(63) +CC(84) - CC(Bs) - 4 


Proof: - It suffices to show that App, Tar Ansc must be the empty set. For then we have three results 
which togetherimply the theorem: (1) ma [(Ate,tag)’] 2 ma[Ansc] - ma{AreatarAnsc] (since for 
any sets Sy and Sz, mf[Sy"}=1-ma[Sy]=ma[SyUSe)-oy [Sy ]=my[S2}-m[S1Sz]); (2) 
ma[Ate, TAR Ansc] = 0; and (3) the Lemma result. 


By the very definition of the set Atp,tanVAnsc, each A in it satisfies 1(A,8) = 
AQA,O,)BA.Op)CA.8 DA. p)vVOEO4U{ 05}, where AtA,8 4) = u,AQtta.Padptal.Ba)dta, 
and similarly for the other three jt-averaged result functions. Furthermore, eachA € Arp, TAR VANsc 
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must satisfy: 
A(QA,84)BA,05)CA,8c)D(A,Op) = ki > 0 (5.2a) 
A(A,04')BA,0p')CA,8c)DA,8p) = ke > 0 (5.2b) 
A(A,84)BA.,8p')C(A,8c')DA,8p) = kg > 0 (5.2c) 
AQ, 8 4'+1)B(A.,8p)CA,8c') DAO p) = ka > 0 (5.2) 
2 —" A(Q,94')BA,83)CQ,8c')DA,Op) = ks < 0 (5.2e) 
Ppa. By simple aigebra, eqns. (5.2a-e)togetherimply: 
ar [A(A.6a)P = kike[A(A.0 4')]?#kaks (6.3) 


Sine by eqns. (5.2a-e), each {l-averaged result function must be nonzero and kyk9/kaks must be 
negative, (5.3) is a contradiction - the left-hand side must be positive while the right-hand side 
negative. QED. 


The final stage of this proof is analogous to the final stage of section 3's strictly correlated 
proof. It differs mainly in our use of Bell's strategy (cf. 4) of first averaging over apparatus 
existents, using TAF. Also, in contrast to section 3, there is no collapse to determinism, and the 
contradiction arises not between Einsteinian elements of reality, but between probabilities for 
measurement results. For this reason, the proof is best referred to as a ‘quasi-algebraic’ proof. But 
note that the proof reduces to a slightly weaker version of our strictly correlated proof if one takes 
QM topredict strict correlations for the four quadruples in *, and strict anti-correlation for O5. In 
that case, CC(8,) = CC(A2) = CC(83) = CC(84) = +1 and CC(@s) = -1, so that the conclusion 
becomes m,[(Arp, tar) ] = 1: differing only from our strictly correlated proof in the inclusion of 
TAF as one of the locality principles whose conjunction must be false with probability 1. 


We now discuss the significance of this nonstrictly correlated proof. Consider first case (1), 
where CC(8) is the experimentally observed correlation when 8 should, according to QM, display 
strict correlation oranti-correlation. Clearlythe above theorem necessitates a renunciation of locality 
for anonzero-measure of particle existents only if the following inequality is observed: 


Quasi-AlgebraicLocalitylnequality.  CC(84) + CC(B2) +CC(83) +CC(G,) + (- CC(Bs)) > 4 


A necessary condition for this inequality to hold is that at least one of the terms summed on the left- 
hand side must empirically be shown to be > 0.8. So if, asin the previous paragraph, 6, through 04 
permit QM strict correlations and 65 anti-correlation - sothat thefirst four terms above would, under 
ideal experimental conditions, be expected to be +1, and the last -1 - ourtheorem allows for at least a 
20% deviation from these theoretical predictions dueto nonideal conditions! 7. 


17 a general, ow proof allows for at Inast a [100/N]9 deviation from theoretical strict sorelations, where N is the 
nun ber of setting quadruples used to derive the contradiction. R is thus advantageous thet N can be mode low: e.g. in 
the text, N=5. For with other algebraie proofs relying on the Bell-Kochen-pecker contradiction, N ~ 100 (26,51-54) _ 
We conjectwa that this will seriously impede their extension to the nonstrictly correlated case from the experimental 
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Consider now case (II) where each E(0) is a QM-predicted nonstrict correlation. Clearly 
many choices of 0; through Qs satisfy the inequality, but for simplicity let us choose: 


64+65-0¢-8p = 64'+9p'-8¢-8p = 0 4+95'-8c'-98p = (O4't1)+9p-0c'-8p (5.4) 


so that the theorem then says ma{(Arp, Tar)’ ] 2 - 5cos(04+9p-9c-9p) - 4. An interesting question 
arises: Does our theorem necessitate a measure of A-states which must violate either TF or TAF thatis _ 
lower than Bell's? 


As mentioned in 4, the strongest conclusion the Bell theorem can reach is ma[(Aqp, Tar)’ | 2 
V2 - 1; whereas this lower limit is improved upon by ourtheorem (with the general choice of settings 
above) fora whole range of settings - specifically when 208° > 04+0p-Oc-8p > 151°. On the other 
hand, if both steength of conclusion (i.e. Aigé measure of states violating locality) and the difficulty 
of measuring tight correlations are considered together, Bell's theorem fares better. For the 
magnitude of the measured correlation Bell requires to establish the lower bound V2 - 1 is {-cos(m/4)| 
*: 0.71; whereas for our proof to zmprove on this lower bound one would need to measure a 
correlationmagnitudeof |-cos(04+0p-Oc-Op)| > (V2 + 3/5 » 0.88, 24% greater than 0.71. In any 
case, if the main experimental interest is simply to establish that the measure of locality-violating 
states must be nonzero, our theorem compares more favourably to Bell's in only requiring the 
capability of measuring 20.80 correlation. 


It remains to be seen how these theorems compare with respect to other practical difficulties: 
e.g. the need for assumptions like Clauser and Horne's™%) 'no-enhancement', and whether such 
assumptions are more plausible than their Bell theorem counterparts. Such issues would interest 
those who continue to be sceptical about the empirical warrant for nonlocality based on recent Bell 
experiment). For our part, although our nonstrictly correlated proof is of practical interest, at a 
philosophical level we concur with GHZ who say: ‘Our general attitude is that we assume that 
quantum mechanics gives correct answers, and the question is whether a classical model can 
reproduce these answers’ (p. 71). We agree fortwo reasons. First, prior to experimental tests of the 
singlet state's correlations, there was a clear need to develop a nonstrictly correlated version of Bell's 
theorem so as to adapt it to experiment. But since these experiments have overwhelmingly confirmed 
the singlet state's nonstrict correlations, they already provide stcong inductive support for its surct 


poim of view. Fwthermore, these other proofs fese the extra practical difficulty of measwing the spin 
Hamikonisn 5). (Since writing this paper, ow conjecture has been confirmed by Elby©) who gives an elegant 
noastrict correlations version of his Bell-Kochea-Specker-type algebraic proof (cf. 4). However, Elby argues that his 
proof has nontrivial implications, not for experiments, but for a conceptual dilemma it poses for the realist: either give 
up locality, or deny that conservation laws can hold etea approximately (ss in the case of CP violetion). For 
analysis and strengthening of Elby's argument whish supports his conclusion, of. Clifton®7}.) 
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correlations, andfurthermore forthe swyct correlations predicted by QM in general: in particular for 
|¥>'s. Second, 2 prrory there is already good reason to accept the existence of strict correlations. 
For they are grounded in fundamental conservation principles which we have reason to believe quite 
apart trom our reasons for believing in QM. 


Finally, itis worth noting that our quasi-algebraic inequality has in effect been independently 
discovered by Greenberger ev#/(5). (‘In effect'>because they derive, in a somewhat different way, 
a 3 particle version of ouc inequality by considering a simpler QM state of the type recently 
discovered by Mermin - cf. note 5.) As in our proof, Greenberger ef a/ drop QM's strict 
correlations, and so drop EPR's argument which can only be launched from these correlations. 
However, their derivation does assume what they call 'EPR's program’, which for them amounts to 
assuming the existence of deterministic result functions with no causal dependence on apparatus 
existents. In contrast, our derivation does not tie itself down to 'EPR's program’, in this sense, and 
s0 our inequality is capable of experimentally falsifying a weaker EPR program than the one they 
postulate, And we believe it reallyis weaker, despite the fact of Stappl9) and Finel!) having general 
proofs of the equivalence of stochastic and deterministic models (cited by Greenberger efa/. ). 


In brief, our reason is this. Recall:- Fine's reduction of a factorable stochastic model -i.e. a 
model which gives probabilitiesfor results A, B, etc. conditional on A and quantities a, b, etc. (i.e. 
pr(AA,a), pr(A,B/A,a,b), etc.) - to a deterministic model goes by defining the big joint 
p(A.A’,...,B,B’,.../a,a’,...,0,6’,...).in terms of the factorable stochastic model, as: 


Za pr(A/A,a)pe(A'A,a')...pe(B/A, b)pr(B'/A.,b’)... pet) (5.5) 


He then easily shows in the usual way that this big joint is equivalent to a deterministic model: eachA 
is a sequence of results <A,A',...,B,B’,...>, etc. Now, given the big joint, the deterministic model 
is uniquely fixed. But given the stochastic model, many big joints exist whose corresponding 
deterministic model delivers the same statistics as the stochasticmodel. (Fine's choice above has the 
particular property that at each A, any values A, B of any 2 quantities a, b are stochastically 
independent.) So one is at liberty to deny the joints’ physical reality - at least until someone gives an 
account of physically real probability which singles out just 1 big joint. (Cf. Butterfield!) for 
further discussion of this point.) 


6. CONCLUSION 


Our main contribution in this paper has been to provide aquite general measure-theoretically 
valid proof of nonlocality along the lines of Greenberger, Horne, and Zeilinger. Along the way we 
discussed a number of related issues, including the plausibility of our assumptions (especially in light 
of the EPR-Bohr debate), and the proof's relationship to other algebraic nonlocality proofs. We also 
discussed the advantages of our proof over Bell's, and derived a new testable quasi-algebraic locality 
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inequality. Although this last result is of practical interest, nevertheless in our view it is subordinate 
to section 3's proof, which invokes correlations that already have inductive confirmation, and are 
motivated by conservation principles. Webelievethis proof makes the case against local realism the 
strongest it has been since John Bell's ground-breaking work over 25 years ago. 


APPENDIX - INDEX OF ABBREVIATIONS USED 
AF Apparatus Factorability: — p(L/A,8)=P(HL4A,8)p(LB/A.,8 )p(tcA.,8 )p(.p/A.,8) 


AL Apparatus Locality: PiaA,8)=ptaA,Ba) ; P(tp/A,0)=p(tBA,6p); 
P(cR,8)=P(cR.8c) ; P(itD/A,8)=p(tpA.8p) 


EPR Einstein, Podolsky, and Rosen 
GHZ — Greenberger, Horne, and Zeilinger 
NSCRI NonStrict CorrelationRecovery Integral: 

LU (eT_P (eA. 1.8))owA,0 du ]o@/eyda = CC(@) VO with CC(@)e {0,1} 
OF QutcomeFactorability: P(E/A,(1,0) = P(ea/A,p,0) Plea, p.8) P(ec/A,p.8) P(enA.p..8) 
OF) _QutcomeFactorability, withprobabilities conditional oninitialexistents(Qo,tlo) 


OL Qutcome Locality: P(Eg/A,|,8) = P(ea/A,jta.Oa) ; P(ep/A,p.8) = Plea, ite.On) 
P(Ec/A,|L,0) = P(EcA, Hc. 8c) i P(Ep/A,p.,0) = P(Ep/A.,p.8p) 


PL ParticleLocality: p(8)=p(A) VA,Ee 


QM Quantum Mechanics 


TAF Total Apparatus Factorability: Conjunction of Apparatus Factorability, 
and Apparatus Locality 

TF Total Factorability: Conjunction of Outcome Factorability, 
and Outcome Locality 


SCRI — Strict CorrelationRecovery Lategral: 


| ACC ZeTeP(eA.p..0))pqua.eydu] paieyaa =+1 VOE 
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